NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Spectrally Transformed Kernel Regression

Zhai, Runtian; Pukdee, Rattana; Jin, Roger; Balcan, Maria-Florina; Ravikumar, Pradeep (May 2024, International Conference on Learning Representations (ICLR), 2024)

Unlabeled data is a key component of modern machine learning. In general, the role of unlabeled data is to impose a form of smoothness, usually from the similarity information encoded in a base kernel, such as the ε-neighbor kernel or the adjacency matrix of a graph. This work revisits the classical idea of spectrally transformed kernel regression (STKR), and provides a new class of general and scalable STKR estimators able to leverage unlabeled data. Intuitively, via spectral transformation, STKR exploits the data distribution for which unlabeled data can provide additional information. First, we show that STKR is a principled and general approach, by characterizing a universal type of “target smoothness”, and proving that any sufficiently smooth function can be learned by STKR. Second, we provide scalable STKR implementations for the inductive setting and a general transformation function, while prior work is mostly limited to the transductive setting. Third, we derive statistical guarantees for two scenarios: STKR with a known polynomial transformation, and STKR with kernel PCA when the transformation is unknown. Overall, we believe that this work helps deepen our understanding of how to work with unlabeled data, and its generality makes it easier to inspire new methods.
more » « less
Full Text Available
Reliable learning in challenging environments

Balcan, Maria-Florina; Hanneke, Steve; Pukdee, Rattana; Sharma, Dravyansh (September 2023, Thirty-seventh Conference on Neural Information Processing Systems)

Full Text Available
LABEL PROPAGATION WITH WEAK SUPERVISION

Pukdee Rattana; Sam Dylan; Balcan, Maria-Florina; Ravikumar, Pradeep (May 2023, International Conference on Learning Representations)

In weakly supervised learning, we aim to reduce the growing demand for labeled data in current machine learning applications. In this paper, we introduce a novel analysis of the classical label propagation algorithm (LPA) (Zhu & Ghahramani, 2002) that takes advantage of useful prior information, specifically probabilistic hypothesized labels on the unlabeled data. We provide an error bound that exploits both the local geometric properties of the underlying graph and the quality of the prior information. We also propose a framework to incorporate multiple sources of noisy information. In particular, we consider the setting of weak supervision, where our sources of information are weak labelers. We demonstrate the ability of our approach on multiple benchmark weakly supervised classification tasks, showing improvements upon existing semi-supervised and weakly supervised methods.
more » « less
Full Text Available
LABEL PROPAGATION WITH WEAK SUPERVISION

Pukdee, Rattana; Sam, Dylan; Balcan, Maria-Florina; Ravikumar, Pradeep (May 2023, International Conference on Learning Representations (ICLR))

Semi-supervised learning and weakly supervised learning are important paradigms that aim to reduce the growing demand for labeled data in current machine learning applications. In this paper, we introduce a novel analysis of the classical label propagation algorithm (LPA) (Zhu & Ghahramani, 2002) that moreover takes advantage of useful prior information, specifically probabilistic hypothesized labels on the unlabeled data. We provide an error bound that exploits both the local geometric properties of the underlying graph and the quality of the prior information. We also propose a framework to incorporate multiple sources of noisy information. In particular, we consider the setting of weak supervision, where our sources of information are weak labelers. We demonstrate the ability of our approach on multiple benchmark weakly supervised classification tasks, showing improvements upon existing semi-supervised and weakly supervised methods.
more » « less
Full Text Available
Label Propagation with Weak Supervision

Pukdee, Rattana; Sam, Dylan; Balcan, Maria-Florina; Ravikumar, Pradeep (May 2023, The Eleventh International Conference on Learning Representation (ICLR))

Full Text Available
Nash Equilibria and Pitfalls of Adversarial Training in Adversarial Robustness Games

Balcan, Maria-Florina; Pukdee, Rattana; Ravikumar, Pradeep; Zhang, Hongyang (April 2023, Proceedings of The 26th International Conference on Artificial Intelligence and Statistics)

Full Text Available

Search for: All records